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SEQUENCE LISTING 

<110> CAMPBELL, ANTHONY KEITH 

<12 0> PROTEIN AND DNA CODING THEREFOR 

<130> WCM.69 .US 

<140> 09/831,142 
<141> 2001-05-07 

<150> PCT/GB99/03654 
<151> 1999-11-05 

<150> GB 9824357.9 
<151> 1998-11-07 

<160> 42 

<170> Patentln Ver. 2.1 

<21Q> 1 
<211> 870 
<212> DNA 

<213> Pholas dactylus 

<220> 

<221> CDS 

<222> (30) . . (704) 

<400> 1 

gaattcggca cgagtcggaa aagaacaaa atg get tgt ate gtt ttc gtt get 

Met Ala Cys He Val Phe Val Ala 
1 5 



ctt gtc get eta tgc tta atg caa ccg ggt tec ggt gag gaa gta caa 
Leu Val Ala Leu Cys Leu Met Gin Pro Gly Ser Gly Glu Glu Val Gin 
10 15 20 

tgc gcg atg aat tgg aca caa get aat gaa tat gtg ttc aac gtg gac 
Cys Ala Met Asn Trp Thr Gin Ala Asn Glu Tyr Val Phe Asn Val Asp 
25 30 35 40 

tgg atg acc att ttc ate tac gac tat ggc get caa gag caa ctg tac 
Trp Met Thr He Phe He Tyr Asp Tyr Gly Ala Gin Glu Gin Leu Tyr 
45 50 55 

gaa gat egg get ttg ggg ctg tgt egg att gaa egg gee ggc cca ggt 
Glu Asp Arg Ala Leu Gly Leu Cys Arg He Glu Arg Ala Gly Pro Gly 
60 65 70 

acc aca aaa gec gtc tgg att aac tgg agt aac gac acg cag tea tgt 
Thr Thr Lys Ala Val Trp lie Asn Trp Ser Asn Asp Thr Gin Ser Cys 
75 80 85 

gta aca aga aaa aca ate ttc ttc gag gtt ggt gga gaa att gee egg 
Val Thr Arg Lys Thr He Phe Phe Glu Val Gly Gly Glu He Ala Arg 
90 95 100 



eta gtt gac tac aga cca cag gaa gac gga act gag aaa act ttt aca 3 89 
Leu Val Asp Tyr Arg Pro Gin Glu Asp Gly Thr Glu Lys Thr Phe Thr 
105 110 115 120 



aga aaa ttc tct age aaa atg cca ggc act tac atg ctt atg gac gtg 437 
Arg Lys Phe Ser Ser Lys Met Pro Gly Thr Tyr Met Leu Met Asp Val 
125 130 135 

tgc get aca agg gac get gat gat aaa tgc ate gaa ggc aca att gtg 485 
Cys Ala Thr Arg Asp Ala Asp Asp Lys Cys lie Glu Gly Thr He Val 
140 145 150 

gtg aca gtc agg gtg tec eta tat gac gaa gat aac aat ggt gta atg 533 
Val Thr Val Arg Val Ser Leu Tyr Asp Glu Asp Asn Asn Gly Val Met 
155 160 165 

gat gaa ggt aag gtg att cca tct gag aca ate gag gat gat ate aag 581 
Asp Glu Gly Lys Val He Pro Ser Glu Thr He Glu Asp Asp He Lys 
170 175 180 

gac tgt ggg etc tta gac caa gat gtt gaa etc gat tat acg tgg act 629 
Asp Cys Gly Leu Leu Asp Gin Asp Val Glu Leu Asp Tyr Thr Trp Thr 
185 190 195 200 

caa aac gag tgt gat eta cca gac aca gta gac gag get gaa gac aca 677 
Gin Asn Glu Cys Asp Leu Pro Asp Thr Val Asp Glu Ala Glu Asp Thr 
205 210 215 

ccg tea gaa act gga gaa ttc ttc tgg tagatctatc agactacttt 724 
Pro Ser Glu Thr Gly Glu Phe Phe Trp 
220 225 

tatcagcagg acaactggtc gttaccagac acctataacg tgtcctcatc aataatgtgt 784 

aaaacagaaa taatcgatag aatattgaaa ataaaatgtt aataaacact ggttgaaata 844 

tgaaaaaaaa aaaaaaaaaa ctcgag 870 

<210> 2 
<211> 816 
<212> DNA 

<213> Pholas dactylus 
<400> 2 

gaatteggea cgagggaaaa gaacaaaatg gcttgtatcg ttttcgttgc tettgtcget 60 
etatgettaa tgcaaccggg ttccggtgag gaagtacaat gcgcgatgaa ttggacacaa 12 0 
gctaatgaat atgtgttcaa cgtggactgg atgaccattt tcatctacga etatggeget 180 
caagagcaac tgtacgagga tegggctttg gggctgtgtc ggattgaacg ggccggccca 24 0 
ggtaccacaa aagccgtctg gattaactgg agtaacgaca cgcagtcatg tgtaacaaga 300 
aaaacaatct tcttcgaggt tggtggagaa attgcccggc tagttgacta cagaccacag 360 
gaagacggaa ctgagaaaac ttttacaaga aaattctcta geaaaatgee aggcacttac 420 
atgcttatgg acgtgtgcgc tacaagggac gctgatgata aatgeatega aggcacaatt 480 
gtggtgacag tcagggtgtc cctatatgac gaagataaca atggtgtaat ggatgaaggt 54 0 
aaggttattc catctgagac aatcgaggat gatatcaagg actgtgggct cttagaccaa 60 0 
gatgttgaac tcgattatac gtggactcaa aacgagtgtg atctaccaga cacagtagac 660 
gaggctgaag acacaccgtc agaaactgga gaattcttct ggtagatcta tcagaccact 72 0 
tttatcagca ggacaactgg tcgttaccag acacctataa cgtgtcctca tcaataatgt 780 



gtaaaacaga aataatcgat agaatattga aaataa 



816 



<210> 3 
<211> 852 
<212> DNA 

<213> Pholas dactylus 
<400> 3 

gtcggaaaag aacaaaatgg cttgtatcgt tttcgttgct cttgtcgctc tatgcttaat 60 
gcaaccgggt tccggtgagg aagtacaatg cgcgatgaat tggacacaag ctaatgaata 12 0 
tgtgttcaac gtggactgga tgaccatttt catctacgac tatggcgctc aagagcaact 180 
gtacgaggat cgggctttgg ggctgtgtcg gattgaacgg gccggcccag gtaccacaaa 240 
agccgtctgg attaactgga gtaacgacac gcagtcatgt gtaacaagaa aaacaatctt 300 
cttcgaggtt ggtggagaaa ttgcccggct agttgactac agaccacagg aagacggaac 3 60 
tgagaaaact tttacaagaa aattctctag caaaatgcca ggcacttaca tgcttatgga 42 0 
cgtgtgcgct acaagggacg ctgatgataa atgcatcgaa ggcacaattg tggtgacagt 4 80 
cagggtgtcc ctatatgacg aagataacaa tggtgtaatg gatgaaggta aggttattcc 540 
atctgagaca atcgaggatg atatcaagga ctgtgggctc ttagaccaag atgttgaact 600 
cgattatacg tggactcaaa acgagtgtga tctaccagac acagtagacg aggctgaaga 660 
cacaccgtca gaaactggag aattcttctg gtagatctat cagaccactt ttatcagcag 720 
gacaactggt cgttaccaga cacctataac gtgtcctcat caataatgtg taaaacagaa 780 
ataatcgata gaatattgaa aataaaatgt taatagacac tggttgaaaa aaaaaaaaaa 840 
aaaaaactcg ag 852 



<210> 4 
<211> 225 
<212> PRT 

<213> Pholas dactylus 
<400> 4 

Met Ala Cys lie Val Phe Val Ala Leu Val Ala Leu Cys Leu Met Gin 
15 10 15 

Pro Gly Ser Gly Glu Glu Val Gin Cys Ala Met Asn Trp Thr Gin Ala 
20 25 30 

Asn Glu Tyr Val Phe Asn Val Asp Trp Met Thr lie Phe lie Tyr Asp 
35 40 45 

Tyr Gly Ala Gin Glu Gin Leu Tyr Glu Asp Arg Ala Leu Gly Leu Cys 
50 55 60 

Arg lie Glu Arg Ala Gly Pro Gly Thr Thr Lys Ala Val Trp lie Asn 
65 70 75 80 

Trp Ser Asn Asp Thr Gin Ser Cys Val Thr Arg Lys Thr lie Phe Phe 
85 90 95 

Glu Val Gly Gly Glu lie Ala Arg Leu Val Asp Tyr Arg Pro Gin Glu 
100 105 110 

Asp Gly Thr Glu Lys Thr Phe Thr Arg Lys Phe Ser Ser Lys Met Pro 
115 120 125 



Gly Thr Tyr Met Leu Met Asp Val Cys Ala Thr Arg Asp Ala Asp Asp 
130 135 140 



Lys Cys lie Glu 
145 

Asp Glu Asp Asn 



Glu Thr He Glu 
180 

Val Glu Leu Asp 
195 

Thr Val Asp Glu 
210 

Trp 
225 



Gly Thr He Val 
150 

Asn Gly Val Met 
165 

Asp Asp He Lys 



Tyr Thr Trp Thr 
200 

Ala Glu Asp Thr 
215 



Val Thr Val Arg 
155 

Asp Glu Gly Lys 
170 

Asp Cys Gly Leu 
185 

Gin Asn Glu Cys 



Pro Ser Glu Thr 
220 



Val Ser Leu Tyr 
160 

Val He Pro Ser 
175 

Leu Asp Gin Asp 
190 

Asp Leu Pro Asp 
205 

Gly Glu Phe Phe 



<210> 5 
<211> 205 
<212> PRT 

<213> Pholas dactylus 
<400> 5 

Glu Glu Val Gin Cys Ala Met Asn Trp Thr Gin Ala Asn Glu Tyr Val 
15 10 15 

Phe Asn Val Asp Trp Met Thr He Phe He Tyr Asp Tyr Gly Ala Gin 
20 25 30 

Glu Gin Leu Tyr Glu Asp Arg Ala Leu Gly Leu Cys Arg He Glu Arg 
35 40 45 

Ala Gly Pro Gly Thr Thr Lys Ala Val Trp He Asn Trp Ser Asn Asp 
50 55 60 

Thr Gin Ser Cys Val Thr Arg Lys Thr He Phe Phe Glu Val Gly Gly 
65 70 75 80 

Glu He Ala Arg Leu Val Asp Tyr Arg Pro Gin Glu Asp Gly Thr Glu 
85 90 95 

Lys Thr Phe Thr Arg Lys Phe Ser Ser Lys Met Pro Gly Thr Tyr Met 
100 105 no 

Leu Met Asp Val Cys Ala Thr Arg Asp Ala Asp Asp Lys Cys He Glu 
115 120 125 

Gly Thr He Val Val Thr Val Arg Val Ser Leu Tyr Asp Glu Asp Asn 
130 135 140 

Asn Gly Val Met Asp Glu Gly Lys Val He Pro Ser Glu Thr He Glu 
145 150 155 160 

Asp Asp He Lys Asp Cys Gly Leu Leu Asp Gin Asp Val Glu Leu Asp 
165 170 175 



5 



Tyr Thr Trp Thr Gin Asn Glu Cys Asp Leu Pro Asp Thr Val Asp Glu 
180 185 190 

Ala Glu Asp Thr Pro Ser Glu Thr Gly Glu Phe Phe Trp 
195 200 205 



<210> 6 
<211> 225 
<212> PRT 

<213> Pholas dactylus 
<400> 6 

Met Ala Cys lie Val Phe Val Ala Leu Val Ala Leu Cys Leu Met Gin 
15 10 15 

Pro Gly Ser Gly Glu Glu Val Gin Cys Ala Met Asn Trp Thr Gin Ala 
M 20 25 30 

C3 Asn Glu Tyr Val Phe Asn Val Asp Trp Met Thr lie Phe lie Tyr Asp 

IjJ 35 4 0 4 5 

Tyr Gly Ala Gin Glu Gin Leu Tyr Glu Asp Arg Ala Leu Gly Leu Cys 

.r= 50 55 60 

m 

Arg lie Glu Arg Ala Gly Pro Gly Thr Thr Lys Ala Val Trp lie Asn 

L 65 70 75 80 

P 

ijj Trp Ser Asn Asp Thr Gin Ser Cys Val Thr Arg Lys Thr lie Phe Phe 

P 85 90 95 

p Glu Val Gly Gly Glu lie Ala Arg Leu Val Asp Tyr Arg Pro Gin Glu 

j** 100 105 110 

Asp Gly Thr Glu Lys Thr Phe Thr Arg Lys Phe Ser Ser Lys Met Pro 
115 120 125 

Gly Thr Tyr Met Leu Met Asp Val Cys Ala Thr Arg Asp Ala Asp Asp 
130 135 140 

Lys Cys lie Glu Gly Thr lie Val Val Thr Val Arg Val Ser Leu Tyr 
145 150 155 160 

Asp Glu Asp Asn Asn Gly Val Met Asp Glu Gly Lys Val lie Pro Ser 
165 170 175 

Glu Thr lie Glu Asp Asp lie Lys Asp Cys Gly Leu Leu Asp Gin Asp 
180 185 190 

Val Glu Leu Asp Tyr Thr Trp Thr Gin Asn Glu Cys Asp Leu Pro Asp 
195 200 205 

Thr Val Asp Glu Ala Glu Asp Thr Pro Ser Glu Thr Gly Glu Phe Phe 
210 215 220 



Trp 
225 



<210> 7 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<220> 

<221> modif ied_base 
<222> (3) 
<223> i 

<400> 7 

acnathttyt tycargt 



<210> 8 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<220> 

<221> modif ied_base 

<222> (12) 

<223> A, T, C or G 

<220> 

<221> modif ied_base 
<222> (15) 
<223> i 

<400> 8 

cargargarg gnacnga 



<210> 9 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<220> 

<221> modif ied_base 
<222> (3) 
<223> i 



<220> 

<221> modif ied_base 
<222> (6) 

<223> A, T, C or G 
<400> 9 

tcngtnccyt cytcytg 



<210> 10 
<211> 18 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<220> 

<221> modif ied_base 
<222> (9) 
<223> i 

<400> 10 

ttyaaygtng aytggatg 

<210> 11 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<:400> 11 

acacagcccc aaagcccgat 

<210> 12 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 12 

ttgcccggct agttgactac 

<210> 13 
<211> 24 
<212> DNA 

<213> Artificial sequence 



8 



<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 13 

catatttcaa ccagtgttta ttaa 24 



<210> 14 
<211> 19 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

p <40 0> 14 

dl caattgtgcc ttcgatgca 19 

m 

lJ <210> 15 

T <211> 17 

^1 <212> DNA 

^ <213> Artificial sequence 

l ; : J 

<220> 

£3 <223> Description of Artificial Sequence: Synthetic 
Iff oligonucleotide 

o 

Sj <400> 15 

^ ggactgtggg ctcttag 17 

<210> 16 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 16 

atggcttgta tcgttttcgt 2 0 



<210> 17 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 



9 




<210> 22 
<211> 20 



10 



<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 22 

caacaacgca cagaatctag 2 0 



<210> 23 

<211> 726 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Clone 40 
amplified by rTth DNA pol XL 

<220> 

<221> modif ied_base 

<222> (644) 

<223> A, T, C, G, other or unknown 



<400> 23 

tatgcttaat gcaaccgggt tccggtgagg aagtacaatg cgcgatgaat tggacacaag 60 
ctaatgaata tgtgttcaac gtggactgga tgaccatttt catctacgac tatggcgctc 12 0 
aagagcaact gtacgaagat cgggctttgg ggctgtgtcg gattgaacgg gccggcccag 18 0 
gtaccacaaa agccgtctgg attaactgga gtaacgacac gcagtcatgt gtaacaagaa 24 0 
aaacaatctt cttcgaggtt ggtggagaaa ttgcccggct agttgactac agaccacagg 300 
aagacggaac tgagaaaact tttacaagaa aattctctag caaaatgcca ggcacttaca 3 60 
tgcttatgga cgtgtgcgct acaagggacg ctgatgataa atgcatcgaa ggcacaattg 42 0 
tggtgacagt cagggtgtcc ctatatgacg aagataacaa tggtgtaatg gatgaaggta 4 80 
aggtgattcc atctgagaca atcgaggatg atatcaagga ctgtgggctc ttagaccaag 54 0 
atgttgaact cgattatacg tggactcaaa acgagtgtga tctaccagac acagtagacg 60 0 
aggctgaaga cacaccgtca gaaactggag aattcttctg gtanatctat cagactactt 660 
ttatcagcag gacaactggt cgttaccaga cacctataac gtgtcctcat caataatgtg 72 0 
taaaac 726 



<210> 24 
<211> 34 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 24 

Asn Leu Arg Asp Glu Asp Asn Asn Leu Leu Asp Glu Asn Gly Asp Leu 
15 10 15 

Leu Pro Leu Glu Ser Leu Glu Leu Asp Gin Asp Val Glu Leu Asp Tyr 
20 25 30 



Thr Trp 




11 

<212> PRT 

<213> Cyprinus carpio 
<400> 25 

lie Met Gin Lys Gly Glu Leu Val Pro Leu Asp Thr Val Leu Asp Met 
15 10 15 

lie Lys Asp Ala Met lie Ala Lys Ala Asp Val Ser Lys Gly Tyr 
20 25 30 



03 



P 



<210> 26 
<211> 20 
<212> PRT 

<213> Synechocystis sp. 
<400> 26 

Asp Gin Val Gin Ser Leu Met Arg Phe Ser Gin Ser Lys Gin lie lie 
15 10 15 

Phe Asn Phe Asp 
20 



M= <210> 27 



<211> 14 
<212> PRT 

<213> Emericella nidulans 
<400> 27 

lie Met Cys Ser Val Asp Trp Thr Arg Arg Asn Arg Phe lie 
15 10 



<210> 28 
<211> 14 
<212> PRT 

<213> Drosophila melanogaster 
<400> 28 

Pro Asp Thr Tyr Asp Glu Glu Glu Asp Thr Tyr Thr His Thr 
15 10 



<210> 29 
<211> 13 
<212> PRT 

<213 > Peptococcus niger 
<400> 29 

Asp Pro lie Asp Glu Ala Gly Glu Val Pro Ser Glu Thr 
15 10 



<210> 30 
<211> 25 
<212> PRT 
<213> Homo 



sapiens 



12 



<400> 30 

Asp Asp Asp Gly lie Gly Tyr Val Glu Asp Gly Arg Glu He Phe Asp 
15 10 15 

Asp Asp Leu Glu Asp Asp Ala Leu Asp 
20 25 



<210> 31 

<211> 59 

<212> PRT 

<213> Vargula sp . 

<400> 31 

Tyr Trp Asn Thr Trp Asp Val Lys Val Ser Leu Arg Asp Val Glu Ser 
15 10 15 

Tyr Thr Glu Val Glu Lys Val Thr He Arg Lys Gin Ser Thr Val Val 
20 25 30 

Asp Leu He Val Asp Gly Lys Gin Val Lys Val Gly Gly Val Asp Val 
35 40 45 

Ser He Pro Tyr Ser Ser Glu Asn Thr Ser He 
50 55 



<210> 32 

<211> 62 

<212> PRT 

<213> Renilla sp . 

<400> 32 

Ala He Lys He Ala Lys Leu Ser Ala Glu Lys Ala Glu Glu Thr Arg 
15 10 15 

Gly Phe Leu Arg Val Ala Asp Gin Leu Gly Leu Ala Pro Gly Val Arg 
20 25 30 

He Ser Val Glu Glu Ala Ala Val Asn Ala Thr Asp Ser Leu Leu Lys 
35 40 45 

Met Lys Ala Glu Glu Lys Ala Met Ala Val He Gin Ser Leu 
50 55 60 



<210> 33 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Illustrative 
P-loop binding motif 



13 



<400> 33 

Ala Ala Ala Ala Gly Lys Thr 
1 5 



<210> 34 
<211> 4 
<212> PRT 

<213> Photinus pyralis 

<400> 34 
His His Gly Phe 
1 



<210> 35 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
illustrative peptide 

<400> 35 

Met Leu Ser Arg Leu Ser Leu Arg Leu Leu Ser Arg Tyr Leu Leu 
15 10 15 



<210> 36 
<211> 19 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
illustrative peptide 

<400> 36 

Lys Lys Ser Ala Leu Leu Ala Leu Met Tyr Val Cys Pro Gly Lys Ala 
15 10 15 

Asp Lys Glu 



<210> 37 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
illustrative peptide 

<400> 37 

Met Leu Leu Pro Val Pro Leu Leu Leu Gly Leu Leu Gly Leu Ala Ala 
15 10 15 



• 

14 



<210> 38 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
illustrative peptide 

<400> 38 
Lys Asp Glu Leu 
1 



<210> 39 
<211> 4 
<212> PRT 
O <213> Artificial Sequence 

yj 

03 <220> 

|ij <223> Description of Artificial Sequence: Synthetic 

m illustrative peptide 

% <400> 39 

j-jl His Asp Glu Leu 

P 

Ul <210> 40 

D <211> 4 

%J <212> PRT 

p <213> Artificial Sequence 

U 

<220> 

<223> Description of Artificial Sequence: Synthetic 
illustrative peptide 

<400> 40 
Lys Glu Glu Leu 
1 



<210> 41 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
illustrative peptide 

<400> 41 

Pro Lys Lys Lys Arg Lys Val 
1 5 



15 



<210> 42 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Illustrative 
N-terminal acylation motif from Tyrosine kinase 



<400> 42 

Met Gly Cys Val Cys Ser Ser Asn Pro Asp 
15 10 



09/831142 

JC08 RflWPCT/PTO O7MAY200f 



SEQUENCE LISTING 

<110> University of Wales College of Medicine 

<120> Protein and DNA coding therefor 

<130> PCT/GB99/03654 

<140> PCT/GB99/03654 
<141> 1999-11-05 

<160> 22 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 870 
<212> DNA 

<213> Pholas dactylus 
<400> 1 



gaattcggca 


cgagtcggaa 


aagaacaaaa 


tggcttgtat 


cgttttcgtt 


gctcttgtcg 


60 


ctctatgctt 


aatgcaaccg ggttccggtg 


aggaagtaca 


atgcgcgatg 


aattggacac 


120 


aagctaatga 


atatgtgttc 


aacgtggact 


ggatgaccat 


tttcatctac 


gactatggcg 


180 


ctcaagagca 


actgtacgaa 


gatcgggctt 


tggggctgtg 


tcggattgaa 


cgggccggcc 


240 


caggtaccac 


aaaagccgtc 


tggattaact 


ggagtaacga 


cacgcagtca 


tgtgtaacaa 


300 


gaaaaacaat 


cttcttcgag gttggtggag 


aaattgcccg 


gctagttgac 


tacagaccac 


360 


aggaagacgg 


aactgagaaa 


acttttacaa 


gaaaattctc 


tagcaaaatg 


ccaggcactt 


420 


acatgcttat 


ggacgtgtgc 


gctacaaggg 


acgctgatga 


taaatgcatc 


gaaggcacaa 


480 


ttgtggtgac 


agtcagggtg 


tccctatatg 


acgaagataa 


caatggtgta 


atggatgaag 


540 


gtaaggtgat 


tccatctgag 


acaatcgagg 


atgatatcaa 


ggactgtggg 


ctcttagacc 


600 


aagatgttga 


actcgattat 


acgtggactc 


aaaacgagtg 


tgatctacca 


gacacagtag 


660 


aegaggctga 


agacacaccg 


tcagaaactg 


gagaattctt 


ctggtagatc 


tatcagacta 


720 


cttttatcag 


caggacaact 


ggtcgttacc 


agacacctat 


aacgtgtcct 


catcaataat 


780 


gtgtaaaaca 


gaaataatcg 


atagaatatt 


gaaaataaaa 


tgttaataaa 


cactggttga 


840 



aatatgaaaa aaaaaaaaaa aaaactcgag 



870 



<210> 2 
<211> 816 
<212> DNA 

<213> Pholas dactylus 
<400> 2 

gaattcggca cgagggaaaa gaacaaaatg gcttgtatcg ttttcgttgc tcttgtcgct 60 
ctatgcttaa tgcaaccggg ttccggtgag gaagtacaat gcgcgatgaa ttggacacaa 12 0 
gctaatgaat atgtgttcaa cgtggactgg atgaccattt tcatctacga ctatggcgct 180 
caagagcaac tgtacgagga tcgggctttg gggctgtgtc ggattgaacg ggccggccca 24 0 
ggtaccacaa aagccgtctg gattaactgg agtaacgaca cgcagtcatg tgtaacaaga 3 00 
aaaacaatct tcttcgaggt tggtggagaa attgcccggc tagttgacta cagaccacag 3 60 
gaagacggaa ctgagaaaac ttttacaaga aaattctcta gcaaaatgcc aggcacttac 42 0 
atgcttatgg acgtgtgcgc tacaagggac gctgatgata aatgcatcga aggcacaatt 480 
gtggtgacag tcagggtgtc cctatatgac gaagataaca atggtgtaat ggatgaaggt 54 0 
aaggttattc catctgagac aatcgaggat gatatcaagg actgtgggct cttagaccaa 600 
gatgttgaac tcgattatac gtggactcaa aacgagtgtg atctaccaga cacagtagac 660 
gaggctgaag acacaccgtc agaaactgga gaattcttct ggtagatcta tcagaccact 720 
tttatcagca ggacaactgg tcgttaccag acacctataa cgtgtcctca tcaataatgt 780 
gtaaaacaga aataatcgat agaatattga aaataa 816 

<210> 3 
<211> 852 
<212> DNA 

<213> Pholas dactylus 



<400> 3 

gtcggaaaag aacaaaatgg cttgtatcgt tttcgttgct cttgtcgctc tatgcttaat 



60 



gcaaccgggt 


tccggtgagg 


aagtacaatg 


tgtgttcaac 


gtggactgga 


tgaccatttt 


gtacgaggat 


cgggctttgg 


ggctgtgtcg 


agccgtctgg 


attaactgga 


gtaacgacac 


cttcgaggtt 


ggtggagaaa 


ttgcccggct 


tgagaaaact 


tttacaagaa 


aattctctag 


cgtgtgcgct 


acaagggacg 


ctgatgataa 


cagggtgtcc 


ctatatgacg 


aagataacaa 


atctgagaca 


atcgaggatg 


atatcaagga 


cgattatacg 


tggactcaaa 


acgagtgtga 


cacaccgtca 


gaaactggag 


aattcttctg 


gacaactggt 


cgttaccaga 


cacctataac 


ataatcgata 


gaatattgaa 


aataaaatgt 


aaaaaactcg 


ag 





cgcgatgaat tggacacaag ctaatgaata 12 0 
catctacgac tatggcgctc aagagcaact 180 
gattgaacgg gccggcccag gtaccacaaa 240 
gcagtcatgt gtaacaagaa aaacaatctt 300 
agttgactac agaccacagg aagacggaac 3 60 
caaaatgcca ggcacttaca tgcttatgga 420 
atgcatcgaa ggcacaattg tggtgacagt 480 
tggtgtaatg gatgaaggta aggttattcc 540 
ctgtgggctc ttagaccaag atgttgaact 600 
tctaccagac acagtagacg aggctgaaga 660 
gtagatctat cagaccactt ttatcagcag 720 
gtgtcctcat caataatgtg taaaacagaa 78 0 
taatagacac tggttgaaaa aaaaaaaaaa 840 

852 



<210> 4 
<211> 225 
<212> PRT 

<213> Pholas dactylus 



<400> 4 

Met Ala Cys He Val Phe Val Ala 
1 5 

Pro Gly Ser Gly Glu Glu Val Gin 
20 

Asn Glu Tyr Val Phe Asn Val Asp 
35 40 



Leu Val Ala Leu Cys Leu Met Gin 
10 15 

Cys Ala Met Asn Trp Thr Gin Ala 
25 30 

Trp Met Thr He Phe He Tyr Asp 

45 



Tyr Gly Ala Gin Glu Gin Leu Tyr Glu Asp Arg Ala Leu Gly Leu Cys 
50 55 60 

Arg He Glu Arg Ala Gly Pro Gly Thr Thr Lys Ala Val Trp He Asn 



65 



70 



75 



80 



Trp Ser Asn Asp Thr 

85 

Glu Val Gly Gly Glu 
100 

Asp Gly Thr Glu Lys 
115 

Gly Thr Tyr Met Leu 
130 

Lys Cys lie Glu Gly 
145 

Asp Glu Asp Asn Asn 

165 

Glu Thr lie Glu Asp 
180 

Val Glu Leu Asp Tyr 
195 

Thr Val Asp Glu Ala 
210 

Trp 
225 



Gin Ser Cys Val Thr Arg 

90 

lie Ala Arg Leu Val Asp 
105 

Thr Phe Thr Arg Lys Phe 
120 

Met Asp Val Cys Ala Thr 
135 

Thr He Val Val Thr Val 
150 155 

Gly Val Met Asp Glu Gly 

170 

Asp He Lys Asp Cys Gly 
185 

Thr Trp Thr Gin Asn Glu 
200 

Glu Asp Thr Pro Ser Glu 
215 



Lys Thr He Phe Phe 
95 

Tyr Arg Pro Gin Glu 
110 

Ser Ser Lys Met Pro 
125 

Arg Asp Ala Asp Asp 
140 

Arg Val Ser Leu Tyr 

160- 

Lys Val He Pro Ser 
175 

Leu Leu Asp Gin Asp 
190 

Cys Asp Leu Pro Asp 
205 

Thr Gly Glu Phe Phe 
220 



<210> 5 
<211> 205 
<212> PRT 

<213> Pholas dactylus 
<400> 5 

Glu Glu Val Gin Cys Ala Met Asn Trp Thr Gin Ala Asn Glu Tyr Val 
15 10 15 

Phe Asn Val Asp Trp Met Thr He Phe He Tyr Asp Tyr Gly Ala Gin 
20 25 30 

Glu Gin Leu Tyr Glu Asp Arg Ala Leu Gly Leu Cys Arg He Glu Arg 
35 40 45 

Ala Gly Pro Gly Thr Thr Lys Ala Val Trp He Asn Trp Ser Asn Asp 



50 



55 



60 



Thr Gin Ser Cys Val Thr Arg Lys Thr lie Phe Phe Glu Val Gly Gly 
65 70 75 80 

Glu lie Ala Arg Leu Val Asp Tyr Arg Pro Gin Glu Asp Gly Thr Glu 

85 90 95 

Lys Thr Phe Thr Arg Lys Phe Ser Ser Lys Met Pro Gly Thr Tyr Met 
100 105 110 

Leu Met Asp Val Cys Ala Thr Arg Asp Ala Asp Asp Lys Cys He Glu 
115 120 125 

Gly Thr He Val Val Thr Val Arg Val Ser Leu Tyr Asp Glu Asp Asn 
130 135 140 

Asn Gly Val Met Asp Glu Gly Lys Val He Pro Ser Glu Thr He Glu 
14 5 150 155 160 

Asp Asp He Lys Asp Cys Gly Leu Leu Asp Gin Asp Val Glu Leu Asp 

165 170 175 

Tyr Thr Trp Thr Gin Asn Glu Cys Asp Leu Pro Asp Thr Val Asp Glu 
180 185 190 

Ala Glu Asp Thr Pro Ser Glu Thr Gly Glu Phe Phe Trp 
195 200 205 



<210> 6 
<211> 225 
<212> PRT 

<213> Pholas dactylus 
<400> 6 

Met Ala Cys He Val Phe Val Ala Leu Val Ala Leu Cys Leu Met Gin 
15 10 15 

Pro Gly Ser Gly Glu Glu Val Gin Cys Ala Met Asn Trp Thr Gin Ala 
20 25 30 

Asn Glu Tyr Val Phe Asn Val Asp Trp Met Thr He Phe He Tyr Asp 
35 40 45 

Tyr Gly Ala Gin Glu Gin Leu Tyr Glu Asp Arg Ala Leu Gly Leu Cys 
50 55 60 

Arg He Glu Arg Ala Gly Pro Gly Thr Thr Lys Ala Val Trp He Asn 



65 



70 



75 



80 



Trp Ser Asn Asp Thr Gin Ser Cys Val Thr Arg Lys Thr lie Phe Phe 

85 90 95 

Glu Val Gly Gly Glu lie Ala Arg Leu Val Asp Tyr Arg Pro Gin Glu 
100 105 110 

Asp Gly Thr Glu Lys Thr Phe Thr Arg Lys Phe Ser Ser Lys Met Pro 
115 120 125 

Gly Thr Tyr Met Leu Met Asp Val Cys Ala Thr Arg Asp Ala Asp Asp 
130 135 140 

Lys Cys He Glu Gly Thr He Val Val Thr Val Arg Val Ser Leu Tyr 
145 150 155 160 

Asp Glu Asp Asn Asn Gly Val Met Asp Glu Gly Lys Val He Pro Ser 
165 170 175 

Glu Thr He Glu Asp Asp He Lys Asp Cys Gly Leu Leu Asp Gin Asp 
180 185 190 

Val Glu Leu Asp Tyr Thr Trp Thr Gin Asn Glu Cys Asp Leu Pro Asp 
195 200 205 

Thr Val Asp Glu Ala Glu Asp Thr Pro Ser Glu Thr Gly Glu Phe Phe 
210 215 220 

Trp 
225 



<210> 7 
<211> 17 
<212> DNA 

<213> Pholas dactylus 
<220> 

<221> modif iecMbase 
<222> (3) 
<223> i 

<400> 7 

acnathttyt tycargt 



<210> 8 



<211> 17 
<212> DNA 

<213> Pholas dactylus 
<220> 

<221> modif ied_base 
<222> (15) 
<223> i 

<400> 8 

cargargarg gnacnga 



<210> 9 
<211> 17 
<212> DNA 

<213> Pholas dactylus 
<220> 

<221> modif ied_base 
<222> (3) 
<223> i 

<400> 9 

tcngtnccyt cytcytg 



<210> 10 
<211> 18 
<212> DNA 

<213> Pholas dactylus 
<220> 

<221> modif ied_base 
<222> (9) 
<223> i 

<400> 10 

ttyaaygtng aytggatg 



<210> 11 
<211> 20 
<212> DNA 
<213> Pholas 



dactylus 



<400> 11 

acacagcccc aaagcccgat 



<210> 12 
<211> 20 
<212> DNA 

<213> Pholas dactylus 
<400> 12 

ttgcccggct agttgactac 



<210> 13 
<211> 24 
<212> DNA 

<213> Pholas dactylus 
<400> 13 

catatttcaa ccagtgttta tt 



<210> 14 
<211> 19 
<212> DNA 

<213> Pholas dactylus 
<400> 14 

caattgtgcc ttcgatgca 



<210> 15 
<211> 17 
<212> DNA 

<213> Pholas dactylus 
<400> 15 

ggactgtggg ctcttag 



<210> 16 
<211> 20 
<212> DNA 
<213> Pholas 



dactylus 



<400> 16 

atggcttgta tcgttttcgt 



20 



<210> 17 
<211> 27 
<212> DNA 

<213> Pholas dactylus 
<400> 17 

ccacacggat cctgaggaag tacaatg 2 7 



<210> 18 
<211> 27 
<212>"DNA 

<213> Pholas dactylus 
<400> 18 

ccacacggat ccttattgat gaggaca 2 7 



<210> 19 
<211> 53 
<212> DNA 

<213> Pholas dactylus 
<400> 19 

cttgttttta tggtcgtcta catttcttac atctatgcgg aggaagtaca atg 53 



<210> 20 
<211> 54 
<212> DNA 

<213> Pholas dactylus 
<400> 20 

ccacacagat ctagaatgaa attcttagtc aacgttgccc ttgtttttat ggtc 54 



<210> 21 
<211> 24 
<212> DNA 
<213> Pholas 



dactylus 



<400> 21 

tttactgttt tcgtaacagt tttg 



<210> 22 
<211> 20 
<212> DNA 
<213> Pholas 



dactylus 



<400> 22 

caacaacgca cagaatctag 



